Dynamic Robust Speech Recognition
نویسندگان
چکیده
Robust recognition theory has become one of research focuses of acoustic speech recognition. Acoustic speech digital signal is a random process repeatedly alternating stationary pieces with non-stationary pieces. However both the current linear and stationary characteristic parameters drawn from such signals and the rigid recognition models do not adapt to such repeatedly alternating property of acoustic speech. Though Missing Feature Approach (MFA) has been proved a considerable solution of enhancement of robustness for noisy speech, MFA classifying in binary way seems to be rough and it cannot used to deal with cepstral feature. Consequently, current noisy speech recognition systems perform mostly poorly. This paper tries to set up dynamic recognition theory that applies non-linear doubly random time series instead of linear model and auto-select types of parameters and recognition models based on nonstationary measure of real-time. Meanwhile this theory gives two approaches of Feature with Confident Weight (FCW) in three means to describes the effect of noise in a more precise way and available in cepstral domain. Experimental results show proposed approaches could improve the recognition accuracy significantly in adverse environment, including stationary and non-stationary noise environments.
منابع مشابه
An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملAn Efficient Block-based Dynamic Range Adjustment Method in Noise-robust Continuous Speech Recognition
This paper proposes a new technique for speech feature estimation under noise circumstances. This new approach yields noise-robust continuous speech recognition (CSR). Noiserobust techniques for isolated word speech recognition typically employ the running spectrum analysis (RSA), the running spectrum filtering (RSF) and the dynamic range adjustment (DRA) methods. Among them, only RSA has been ...
متن کاملRobot Arm Performing Writing through Speech Recognition Using Dynamic Time Warping Algorithm
This paper aims to develop a writing robot by recognizing the speech signal from the user. The robot arm constructed mainly for the disabled people who can’t perform writing on their own. Here, dynamic time warping (DTW) algorithm is used to recognize the speech signal from the user. The action performed by the robot arm in the environment is done by reducing the redundancy which frequently fac...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملRobust Speech Recognition with MSC/DRA Feature Extraction on Modulation Spectrum Domain
This report introduces noise robust speech recognition and proposes advanced speech analysis techniques named MSC (Modulation Spectrum Control)/DRA (Dynamic Range Adjustment). The dynamic range of cepstrum obtained from noisy speech is usually smaller than that from the same speech without noise since some speech features are hidden in noise. This difference may cause recognition errors. Theref...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006